A functional articulatory dynamic model for speech production
نویسندگان
چکیده
This paper introduced a new speech production model aiming at synthesizing natural speech in real-time by modeling the key dynamic properties of the articulators in a nonlinear state-space framework. The goal-oriented movement of the tongue tip, tongue dorsum, upper lip, lower lip and jaw are described in a linear state equation. The so produced articulatory trajectories combined with the effects of velum and larynx are mapped into acoustic features in the nonlinear observation equation. The input and output of the model are time-aligned phone sequence and speech waveform respectively. This speech production model can also be directly applied to speech recognition to better account for coarticulation and phonetic reduction phenomenon with considerably less parameters than the traditional HMM based approaches.
منابع مشابه
Acoustic-to-articulatory inversion using a speaker-normalized HMM-based speech production model
Acoustic-to-articulatory inverse mapping is a difficult problem because of its non-linear and oneto-many characteristics. We have previously developed a speech inversion method using a hidden Markov model (HMM)-based speech production model which takes into account the phonemespecific dynamic constraints of articulatory parameters. We found that the constraint significantly decreases the estima...
متن کاملArticulatory Features and Associated Production Models in Statistical Speech Recognition
A statistical approach to speech recognition is outlined which draws close parallel with closed-loop human speech communication schematized as a joint process of encoding and decoding of linguistic messages. The encoder consists of the symbolically-valued overlapping articulatory feature model and of its interface to a nonlinear task-dynamic model of speech production. A general speech recogniz...
متن کاملMethod for Speech Inversion with Large Scale Statistical Evaluation
An articulatory model of speech production is created for the purpose of studying the links between speech production and perception. A computationally effective method for speech inversion in proposed, using a two-pole predictor structure in order to maintain better articulatory dynamics when compared to conventional dynamic programming methods. Preliminary tests for the effect of inversion ar...
متن کاملProduction-Oriented Models for Speech Recognition
Acoustic modeling in speech recognition uses very little knowledge of the speech production process. At many levels our models continue to model speech as a surface phenomenon. Typically, hidden Markov model (HMM) parameters operate primarily in the acoustic space or in a linear transformation thereof; state-to-state evolution is modeled only crudely, with no explicit relationship between state...
متن کاملBOSTON UNIVERSITY GRADUATE SCHOOL OF ARTS AND SCIENCES Dissertation AN INVESTIGATION OF ARTICULATORY-ACOUSTIC RELATIONSHIPS IN SPEECH PRODUCTION by
This thesis is a combination of empirical and modeling work concerning articulatory-acoustic relationships in speech production. The empirical work investigates the functional relationship between articulatory variability and stability of acoustic cues during American English /r/ production. The analysis of articulatory movements shows that the extent of intra-subject articulatory variability a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001